Implementation of generalized quantum measurements: superadditive quantum 
coding, accessible information extraction, and classical capacity limit 
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Quantum information theory predicts that when the transmission resource is doubled in quantum 
channels, the amount of information transmitted can be increased more than twice by quantum 
channel coding technique, whereas the increase is at most twice in classical information theory. This 
remarkable feature, the superadditive quantum coding gain, can be implemented by appropriate 
choices of code words and corresponding quantum decoding which requires a collective quantum 
measurement. Recently, the first experimental demonstration was reported [Phys. Rev. Lett. 90, 
167906 (2003)]. The purpose of this paper is to describe our experiment in detail. Particularly, 
a design strategy of quantum collective decoding in physical quantum circuits is emphasized. We 
also address the practical implication of the gain on communication performance by introducing the 
quantum- classical hybrid coding scheme. We show how the superadditive quantum coding gain, even 
in a small code length, can boost the communication performance of conventional coding technique. 

PACS numbers: 03.67.-a, 03.65.Ta, 89.70,+c 



I. INTRODUCTION 

It is a fundamental problem in information science 
what is the most efficient way of transmitting information 
with a minimum of transmission resources. The amount 
of information transmissible through a communications 
channel is determined by the noise characteristics of the 
channel and by the quantities of available transmission 
resources. In classical communication theory 0, 0, 0, 
the amount of transmissible information can be increased 
twice at most when the transmission resource (e.g. the 
code length, the signal power, the bandwidth) is dou- 
bled, for the fixed noise characteristics of the channel. In 
quantum communication theory, however, this is not true 
in general, that is, the amount of information transmit- 
ted can be increased even more than twice. This feature 
is called the superadditivity of the capacity of quantum 
channel EEE EE, 0. 

The superadditivity becomes essential in any trans- 
mission of signals at the quantum level where ambiguity 
among signals is a matter of non-commutativity of the 
density matrices, i.e. popi ^ pipo, rather than any clas- 
sical noises such as thermal noise. One typical example 
is deep space optical communications. Inter-satellite op- 
tical link is expected to achieve a high transmission rate 
that cannot be achieved by the radio- or micro- wave links, 
e.g. to realize data transmission from a space-telescope 
with billion pixels or real time communications over the 
planets. In a deep space optical link, the sender prepares 
coherent state signals with as large amplitude as possi- 
ble allowed by a limited power supply. Such signals are, 
however, extremely weakened at the receiving end, typi- 
cally less than a few photons per pulse, due to the beam 
divergence and energy loss. Since the energy quantum 
of carriers is greater than that of thermal noise in op- 
tical domain, i.e. hco > k^T, physical states of carriers 
can be described by pure quantum states in good ap- 



proximation. For example, the binary phase shift keyed 
signals to convey classical letters and 1, respectively, 
are represented by the coherent states po = \a)(a\ and 
pi = |— a|, respectively. For weak coherent pulses, 
the state overlap (a\—a) becomes no n- negligible, i.e. po 
and pi are noncommuting. According to the uncertainty 
principle, noncommuting density matrices can never be 
distinguished perfectly. This imposes an inevitable error 
in si gna l detection even in an ideal communications sys- 
tem Actually, when \a\ 2 < 3, po and pi can not be 
distinguished at a bit error rate less than 10 -6 , which is a 
typical error free criterion in deep space communications. 

Historically, an extension of communication theory 
into quantum domain including this aspect of ambigu- 
ity has been explored since 1960's [H 111 H 13 • In 
1973, Holevo derived the quantity that bounds the up- 
per limit of the capacity of a quantum communications 
channel |l^. It was recently shown that this so-called 
Holevo bound is an achievable rate, that is, the exact 
expression of the capacit y [TH Hfl IT^ . Classical com- 
munication theory 0, E E describes the special case 
where the signals are given by commuting density ma- 
trices. The distinctive characteristics of quantum theory 
of capacity is the great emphasis on quantum decoding 
process to extract information from block sequences of 
noncommuting density matrices. The essence of the op- 
timal decoding is the use of a process of entangling letter 
states constituting code words prior to measurement to 
enhance the distinguishability of signals. Such a process 
is a quantum computation on code word states. This so- 
called quantum collective decoding is a new aspect, not 
found in conventional coding techniques, and leads to a 
larger capacity. This is called the superadditive quantum 
coding gain (SQCG) EEEEEE m a quantum chan- 
nel since the length n quantum coding makes capacity 
more than n times larger from the capacity achievable 
only by conventional coding. Taking it into account, the 
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capacity is defined as the maximum rate of the mutual 
information for a (quantum) code of length n divided by 
the length n in the limit of n — » oo for asymptotic error 
free transmission. 

The theory of capacity, however, generally gives no 
guidance on how to construct codes that approach the 
capacity. A practical problem is then to find good codes 
to attain a large SQCG in a small block length. This 
must be an important issue in any communications and 
information-processing systems when they work at the 
quantum level, which is expected in a few decades consid- 
ering recent exponential growth of info-communication 
demands. However, little attention has been paid to this 
topic so far. Only several coding schemes have been 
proposed to exhibit SQCG 0, 0, Q, H and the first 
experimental demonstration has recently been reported 
by the authors The purpose of the present paper 

is to give detailed information that was abbreviated or 
omitted in our previous letter. An attention is particu- 
larly paid to describe the strategy how to implement the 
quantum measurements used in our experiments by log- 
ical and physical quantum circuits. We also describe the 
detailed discussion of the implications of SQCG in small 
code length on practical communication performances. 

The paper is organized as follows. In Sec. [HJ we re- 
mind readers of several capacities of quantum channels 
studied to date, and explain our scenario. In Sec. the 
basic notion for capacity theorem and SQCG are briefly 
explained. In Sec. II VI we discuss how we designed logi- 
cal and physical quantum circuits for our SQCG exper- 
iment, which was omitted in our previous letter. Sec- 
tion describes in detail our experiment of SQCG re- 
ported in Ref. 18]. We also show the experimental results 
about the separable quantum measurements attaining 
the single-shot capacity and the accessible information 
for comparison. In Sec. IVIl we discuss how SQCG, even 
the small gain demonstrated in length two coding, can 
boost a communication performance attained by conven- 
tional coding technique. The idea is based on quantum- 
classical hybrid coding (QCHC) which was briefly men- 
tioned in our previous letter |l8|. Theoretical details on 
the methodology of QCHC are given. Section IVTA is for 
concluding remarks. 



II. CAPACITIES FOR QUANTUM CHANNELS 

Since Shannon's capacity theory was extended into 
generic quantum states in Refs. 0, 0, ^J, the capac- 
ity theory is further extended to include new auxiliary 
resources of entangled particles, new quantum protocols, 
and a new object to be transmitted, i.e. intact quan- 
tum state [l9j. The notion of the capacity for quan- 
tum channels is now classified into two categories; 1) the 
classical capacity for transmitting conventional (classi- 
cal) alphabet, and 2) the quantum capacity for transmit- 
ting quantum alphabet (unknown quantum states). For 
both categories entanglement assisted protocols may be 



considered, namely superdense coding [2(J and quantum 
teleportation 21], respectively. Our concern is the first 
category, i.e. the classical capacity. 

Depending on whether additional entanglement re- 
sources are brought into play or not, the classical capacity 
is classified into two kinds, namely, the entanglement- 
assisted capacity Ce and the ordinary capacity C. The 
former is defined for a quantum channel with the help of 
unlimited prior entanglement sharing between the sender 
and the receiver |22| • The latter is defined for a quantum 
channel with the help of any allowed quantum opera- 
tions at the sender (quantum encoding) and the receiver 
(quantum decoding), but without any prior source shar- 
ing. Both schemes assume multiple uses of the channel, 
that is, coding, and the capacity is defined as the max- 
imum amount of transmissible information per channel 
use. It should be noted that shared entanglement is not 
regarded as the transmission resources in the definition 
of the entanglement-assisted capacity Ce- This type of 
classification can simplify the study of several distinct 
capacities and their relation including the quantum ca- 
pacities [22^ . 

From a practical point of view, on the other hand, it is 
not realistic to expect the assistance of unlimited exter- 
nal resources. To predict the highest transmission rate 
in realistic situations, all the physical entities used for 
transmission, such as the shared entanglement, must be 
included in the elements constituting a communications 
channel. In this situation, the ordinary capacity C is ap- 
propriate to evaluate the communication performance of 
the channel since it imposes the power constraint condi- 
tion of the total physical resources. 

Keeping such backgrounds in mind, we restrict our dis- 
cussion to the following protocol. The sender transmits 
classical alphabet in classically encoded format, i.e. in 
separable tensor product states (code word states) made 
of a given set of letter states, and these code word states 
are still separable at the receiving end. No prior entangle- 
ment is shared between the sender and the receiver. The 
receiver may apply any quantum operations to the re- 
ceived code word states. In fact, it is known that effective 
quantum coding is that the receiver entangles the letter 
states prior to detection. This is called quantum decod- 
ing and contributes to SQCG which is never observed in 
any classical coding. Such scenario is within the frame- 
work of the ordinary capacity C, and exactly the case 
that the capacity theories of Refs. [Dl HH E] concern. 
The reasons for choosing such protocol and for excluding 
quantum encoding and prior entanglement sharing are 
1) concrete quantum coding schemes are known only for 
such protocol at present, and 2) it fits better to practical 
motivations introduced in Sec. U since it does not neces- 
sarily require a transmission of nonclassical state signals. 
The main task in this paper is, therefore, the demonstra- 
tion of the quantum decoding process. 
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III. SUPERADDITIVE CODING GAIN 

A practical mean for effective communications is cod- 
ing, that is, representing alphabet by sequences of simple 
letters such as {0, 1}. Alphabets to be transmitted are 
represented by code words which are sequences of a given 
set of letters {xq, such as the binary set {0, 1}. 

The transmitter modulates a signal carrier into one of L 
states {po, Pl-i} according to the input letter. If the 
letter states {po, pL-i} appear as orthogonal states at 
the receiving end, then they can be distinguished per- 
fectly and log 2 L bits of information, which is the max- 
imum Shannon entropy of the set {xo, ...,#z,-i}, can be 
faithfully retrieved per letter. This is, however, not the 
case in general. A channel is usually subject to various 
types of noise disturbances. In order to transmit infor- 
mation reliably through a channel with finite errors, one 
must introduce some redundancy in code word represen- 
tation prior to transmission so as to allow the correction 
of errors at the receiving side. This entails adding some 
redundant letters to the code words and hence increases 
their length. This is channel coding. 

First, the source encoder converts the original mes- 
sage into a sequence of the letters in the given set 
{xq, ...,xl-i}, and then the channel encoder divides it 
into blocks of length k (message blocks). Each block is 
supplemented by an additional block (redundant block) 
of n—k (n>k) letters to compose a channel code word 



message block redundant block 



(0 (*) 

1 2 



(1) 



(for i = l,2,...,L fe 



Note that although there are L n possible sequences of 
length n in total, only part of them, i.e. L k sequences, 
are used as code words. This redundancy, together with 
appropriate encoding and decoding, allows us to recover 
possible errors in transmission. The amount of informa- 
tion conveyed by the above code words is K = k log 2 L 
bits. The transmission rate is then defined by R = 
K/n = (k/n)\og 2 L bits/letter. For a channel with a 
capacity C bits/letter, it is possible Q within the 
rate R = K/n < C to reproduce the K bits of messages 
with an error probability as small as desired by appro- 
priate encoding and decoding in the limit n — > oo. 

A mathematical model of a channel is specified by a set 
of possible outputs {y} from the channel and a channel 
matrix in which each matrix element is given by the con- 
ditional probability P(y\x) of having y given the input 
x. Each input letter x is used with a priori probability 
P(x). The probability of having y is then given by 



P{y) = Y^P{y\x)P{x). 



(2) 



To define the capacity, Shannon introduced the mu- 
tual information 



This is defined between the in- 



put variable X = {x;P(x)} and the output variable 
Y = {y;P(y)}as 



I(X :Y) = y £P(x) y £P(y\x)\og 

y 



P(y\x) 



^P(x')P(y\x') 



(3) 

In classical information theory, one considers coding 
for a given and fixed channel model {P(y\x)}. The de- 
coding error of code words {xi,...,x L fc} can be calcu- 
lated based on the probability distributions {P(x)} and 
{P(y\x)}. The capacity (for a memoryless channel) is de- 
fined as the maximum mutual information with respect 
to the prior distribution of the letters P(x), 



C = max I(X : Y) . 

{P(x)} 



(4) 



In the quantum context, however, only the input vari- 
able X and the corresponding set of quantum states at 
the receiver's hand denoted as {p x } are given. The out- 
put variable Y is to be sought for the best quantum mea- 
surement. A quantum measurement process can mathe- 
matically be described by a set of non-negative Hermitian 
operators {tl y } satisfying the probability conservation re- 
lation Ily = I, so-called the positive operator valued 
measure (POVM). The channel matrix is then given by 



P(y\x) = Tr (fl y p x ) , 



(5) 



and now one can define the maximum extractable infor- 
mation 



^Acc = max/(X : Y), 



(6) 



which is called the accessible information. More gener- 
ally, the quantity further maximized with respect to the 
prior probability 



C\ = max max/(X : Y). 

{P(x)} { u y} 



(7) 



specifies the classical limit of the capacity when the given 
initial channel {P(y\x)} is used with classical channel 
coding [2^1 . It is this quantity that limits the perfor- 
mance of all modern communications systems. This is, 
however, not the ultimate capacity allowed by quantum 
mechanics. 

The code words {x} are now conveyed by the quantum 
states in a tensor product of the letter states, \£ x = p Xl <g> 
• • • (8) p Xn . To decode them, one may design the best 
quantum measurement allowed by quantum mechanics. 
This is described by the POVM {Ily} on the extended 
space where {y} are decoded code words. The channel 
matrix for this extended channel is given by 



P(y|x) = Tr 



(8) 
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One may then define the mutual information for this ex- 
tended channel by 



/(X":r") = ^P(x)]TP(y|x)log 



Further one can define the quantity 



^(y|x) 



EP(x')P(y|x') 

(9) 



C n = max maxiYX™ : Y n ), 
™>{n y } 



(10) 



which we refer to the capacity of order n. The super ad- 
ditivity of quantum channel is then expressed as 



C n > nd. 



(11) 



The capacity of quantum channel as the maximum rate 
of error free transmission is defined by 



C= lim ^. 

n— >oo TL 



(12) 



The property of Eq. {TTJ was first predicted by Holevo 
based on the random coding technique Peres and 
Wootters conjectured that 



I(X 2 : Y 2 ) > 2max/(X : Y), 
{n y } 



(13) 



by using the ternary symmetric states of qubit Q. In 
these works, the importance of using quantum collective 
measurement on a block sequence of code word state was 
emphasized. The first rigorous example of the superad- 
ditivity was given by Sasaki et al. for the binary pure 
letter states |6j, where the quantum channel showing 



/(X 3 : Y 3 ) > 3Ci, 



(14) 



was explicitly demonstrated. Since then several examples 
of quantum code construction with the superadditivity 
were clarified HQ 0,0. 

The important observation of the superadditivity is 
that the property 



P(y\K)^P(y 1 \x 1 )..-P(y n \x n ), 



(15) 



generally holds when an appropriate collective POVM is 
chosen. This is a kind of memory effect of the extended 
channel. When the measurement is made by projection 
onto separable bases, such a memory effect never takes 
place. Projection onto appropriate entangled bases in- 
duces quantum interferences among the code word states 
to reduce the ambiguity among the signals. The memory 
effect is a direct consequence of this quantum interference 
of block codes, that is, exactly the effect of the entangle- 
ment. Realization of such a quantum collective decoding 
generally requires quantum computation to entangle the 
letter states 



quire 
, 24]. 



IV. MODEL FOR PROOF-OF-PRINCIPLE 
DEMONSTRATION: QUBIT TRINE 

Quantum collective decoding on quantum particles, 
that is, entangling quantum particles, is something very 
hard to realize at present even for two particles. In addi- 
tion, the gains predicted for short length codes are very 
small. Here we consider how one can demonstrate the 
principle of quantum collective decoding. We deal with 
the noiseless channel model in which only the noncom- 
mutativity of the signals causes the transmission error. 

The simplest set of letters is the binary set of pure 
states (l^o) ? |^i)} where the overlap between the letters 
is (^ol^i) = K - Only for this set, the classical capac- 
ity limit C\ is known with rigorous mathematical proof 
HE HI E3 • The very first step is the length two cod- 
ing. We have four possible tensor product sequences 
{|^o> l^o) , |^o) , l^i) l^o) , hM}- Buck et al i 
showed that, by choosing three of them as code word 
states, the channel exhibits the superadditivity depend- 
ing on the overlap k. Unfortunately, the predicted max- 
imum SQCG was only J 2 /2 — C\ = 5.2 x 10 -4 bits and 
it seems too small to be observed experimentally. In the 
length three coding |(|, the four code words were picked 
up from eight possible sequences so that the Hamming 
distance between each code word is equal [d = 2) to show 
the gain described in Eq. (|14|). In this case, the maximum 
gain was predicted to be 9 x 10 -3 bits. Although the gain 
is bigger than that in the length two coding, entangling 
three qubits for the quantum collective decoding requires 
more than ten steps of quantum gating, which seems to 
be difficult to realize. 

Therefore, we consider the second simplest case, the 
qubit trine signals. The qubit trine consists of the ternary 
symmetric letter states of a qubit {l^o) ? I^i) > IV^)}- It is 
this model which we use to demonstrate our experimen- 
tal steps toward SQCG in the next section. It should 
be noted that the dimensionality (2-dim. ) is essential 
here. If the ternary states are defined in a higher dimen- 
sional space than three, such as the lifted trine, we do not 
know the exact value of C\. In addition, according to our 
numerical studies, SQCG in terms of the mutual infor- 
mation appears smaller compared to that in the qubit 
trine case. The qubit trine is defined by the letter state 
set {|^o), |^2)} with 



l^o) 
l^i) 
1^2) 



|0>, 



(16a) 
(16b) 

(16c) 



where {|0), |1)} is the orthonormal basis set. They are 
represented in Fig.^Ja). 

The accessible information, defined by Eq. (0), for 
this set with equal prior probabilities is found to be 
^Acc = 0.5850 bits with a rigorous proof [28|. The opti- 
mal measurement strategy is described by the nonorthog- 
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l</>o> 
l"o> 



l^22> l^ll) 
\ \ 



(a) 



>i> 
(b) 



-re 

|n 22 > 
inn) 




l^oo> 
/ 



\ 

|n 00 > 



(c) 



FIG. 1: Geometrical representation of several sets of quan- 
tum state vectors and measurement vectors, (a) The ternary 
symmetric letter states (qubit trine), (b) The letter (dotted 
arrows) and the measurement (solid arrows) state vectors to 
attain the C\. (c) The code word (dotted arrows) and de- 
coding (solid arrows) state vectors represented in a real three 
dimensional space. 



onal basis set {|cjo)> 1^2 )} of 



\uj 2 ) 



- sin - 



7Ac 



|0) 



V2 



10) 



11), 

+ 71 cos ~ 

1 7ao 

—= cos 

V2 2 



|1>, 



11), 



where 7a cc is defined by 



7Acc , 7T 

cos = cot — . 

2 3 



(17a) 
(17b) 

(17c) 
(18) 



This is a typical example of generalized quantum mea- 
surement, and was demonstrated in laboratory for polar- 
ization qubit of a photon in Refs. [2^, HU- The func- 
tional meaning of this quantity /acc is as follows. If 
the receiver applies this detection separately on each let- 
ter states (separable decoding), and encoding is made 
such that each letter states occurs with equal probabili- 
ties in the set of code words, then the maximum trans- 
mission rate for error free transmission is Jacc = 0.5850 
bits/letter. 

One may further optimize the quantity besides the de- 
tection strategy. For the ternary set, C\ has been care- 
fully studied and evaluated to be 0.6454 bits [53, EI- 
This is attained by discarding one of the three letters 
and using only two of them, say {|^o) 7 l^i)}? with equal 
probability 1/2 and applying the measurement described 
by the ort honor mal basis, 



y/2 + y/3 
V3 



IV*>- 



y/2- yg . 
Wi) = -y| IW 



y/2- y/3 

^/3 
y/2 + V3 

Vs 1 



IV'i), 



This is schematically illustrated in Fig.HJb). 

Now we construct the length two coding. For the qubit 
trine letters, there are nine possible sequences. Peres and 
Wootters showed 5] that if one uses only three of them, 



which are 

\Vxx) = \*l>x)®\ll>x) 

= 1(1 + cos^)|0)|0) + \ sin^(|0)|l) + |1)|0) 
+ I(l-cos^)|l)|l), (20) 

where <p x = 2ttx/3 (x = 0, 1, 2), as the code word states 
with equal probability, and decodes them by the square- 
root measurement defined mathematically by 



\^yy)= I £ 



(21) 



then I(X 2 : Y 2 ) = 1.3690 bits of information can be 
retrieved in principle. This is larger than twice of C\ 
(= 0.6454). The SQCG is 7 2 /2 - d = 0.0391 which is 
expected to be accessible in laboratory. 

The measurement basis Eq. (|2l|) is explicitly written 

as 



in, 



00/ 



a I* 



00/ 



b I* 



00/ 



pll) 

|n 22 ) = b |^oo 



where 



a = 



-b |*n) + 6 |*22>, 
a |^ii) + 6 ^22), 
b l^n) + a ^22), 

+ y/2 



3V3 ' 
2-V2 
3V3 



(22a) 
(22b) 
(22c) 

(23a) 
(23b) 



These bases are entangled states and the measurement 
described by the POVM {| 11^) (11^1} is a typical ex- 
ample of quantum collective decoding. The ternay code 
word states {1^3/^)} can be described by the real vectors 
in a three dimensional space spanned by {|0)|0), |0)|1) + 
|1)|0), |1)|1)}, as seen from Eq. (|2U|) . This ternary set 
is called the lifted trine, and provides interesting in- 
sights in to q uantum measurement problems as discussed 
by Shor |3lL 132^. The measurement basis {JII^)} forms 
another orthonormal basis set in the three dimensional 
space. {I^^x)} and {|n y2/ )} are geometrically depicted in 
Fig. dc). The code words {l^a)} are distinguished by 
the projection to each of the nearest \H y y). The channel 
matrix [P(i/|x)] = | (n^ \^ xx ) | 2 is expressed as 



(24) 

(25a) 
(25b) 







cos 2 ^ 


1-27 

2 sin \ 


1 -271 

2 sin \ 


(19a) 


[P(y\x)} = 


2 sin \ 


cos 2 ^ 


1-27 

5 sin \ 


(19b) 


where 


. \ sin2 2 


\ sin2 2 


cos 2 ^ 



7 

cos — 
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sin ■ 



7 



V^+l 
a/6 ' 
V2-1 

~7T' 



6 



In this noiseless model the channel is essentially a mea- 
surement channel whose ambiguity is due to nonorthog- 
onality of the code word states. 

While the square root measurement is simply ex- 
pressed by the von Neumann measurement in the three 
dimensional real space in terms of Eqs. ((22)) . this math- 
ematical expression informs us of nothing special about 
physical implementations of the decoder. There may be 
many possible ways to realize effectively the measure- 
ment channel matrix of Eq. (|24|) . One of systematic and 
straightforward ways is to express the original measure- 
ment bases as simple separable bases plus additional uni- 
tary transformation, and to convert the unitary transfor- 
mation into a quantum circuit 0,0- Along this line, 
we derive a quantum circuit realizing (lll^)}. 

Let us rewrite the POVM {|n^) as 



|n 00 ) 

|Hii> 
\S) 

|n 22 ) 



#t| )|0>, 
£t| )|l>, 
tft|i)| ), 

^11)11), 



(26a) 
(26b) 
(26c) 
(26d) 



where \S) = (|0)|1) - \1)\0))/V^- The unitary operator 
U can be given by the matrix representation 



U 



COS 77 
1 • 7 

vl sin i 



1 • l 

7I sm 2 





i 

2 

1 
V2 
_ 1 
2 





1 

2 

1_ 

V2 
_ 1 
2 

the 



sin ^ 

1 7 

71 cos I 







V2 



cos ■ 



(27) 



with respect to the separable basis 
{|0)|0), |0)|1), |1)|0), |1)|1)}. Circuit construction 

for this unitary operator can be carried out in the fol- 
lowing way. With the help of the Gaussian elimination 
algorithm |3^, U can be decomposed into a product of 
U(2) operators Ty }i ] as 



U = Tr 



[2,1] T [3,1] 



' T[4,2] T[4,3]i 



(28) 



where Ty^j represents the two-dimensional rotation op- 
erators between the i-th and j-th basis vectors. Each 
operator Tj^j is then converted into a quantum circuit 
by using the formulae established by Barenco et al [34^ . 
The quantum circuit derived along this line consists of so 
many 2 bit basic gates, and is generally not in the min- 
imal form. We further compiled the circuit into a much 
simpler version in a heuristic way. The final and possibly 
the simplest quantum circuit for U is shown in Fig. O It 
consists of five controlled-unitary gates. 



0(-5) — fiy("7) 



0(f) 



-CH0> or |l> 
von Neumann 
measurement 

-D-|0> or |1) 



FIG. 2: Quantum circuit to realize the quantum collective 
decoding by the square root measurement {|n yy )}. A re- 
ceived code word state is first transformed by the five con- 
trolled gates, and then is detected by a standard von Neu- 
mann measurement on each letter separately. Nomenclature 
of the controlled gates is based on Ref. |34|. Q(<p) is the uni- 
tary operator defined as Q(<p) = R y ((p)a z . The open circle 
notation indicates conditioning on the 'control' qubit being 
set to zero. 



states. The quantum circuit of Fig. [2| then requires 
photon-photon gates. Although the principle of such 
photon-photon gates have been demonstrated experimen- 
tally |33 , its precision is still far below the level required 
to access the small superadditive coding gain. Even if we 
rely on other, not flying, qubit systems such as trapped 
ion or molecules in NMR for which quantum gating has 
been demonstrated to date, it seems still formidable to 
run a five-step gating operation with the required preci- 
sion. 

Therefore we consider the length two coding based on 
the two physically different kinds of qubit, namely, a po- 
larization and location qubits of a single photon. The 
first and second letter states of a code word are drawn 
from the ternary letter state sets of a polarization and a 
location qubits, {\^ x )p} and {1^)^}, respectively. Then 
the collective decoding can be realized by an optical cir- 
cuit consisting only of linear passive components, and 
a sufficiently high gating precision can be attained. In 
fact, by using the same polarization- location encoding 
format |3y, |37], |23, several quantum alg orithms have 
been demonstrated experimentally |39l l4(]j . 

In the following subsections, we first describe the phys- 
ical implementation based on the polarization-location 
format. Then we discuss three kinds of experiments 
on quantum measurement of the accessible information, 
the single-shot capacity, and SQCG attained by the sec- 
ond order mutual information. These schemes are on a 
structured scenario of the capacity theory as described 
in Sec. III. They also correspond to the most typical 
measurements for the same qubit trine in the framework 
of quantum measurement theory, i.e. the von Neumann 
measurement, the single-shot generalized measurement, 
and the collective measurement. 



V. IMPLEMENTATION 



A. Preparation of optical qubit states 



A favorable qubit trine is made of a flying qubit of 
photons. It would be natural to construct code words 
by the pairs of photons in the same linear polarization 



The polarization qubit consists of the horizontal |0)p = 
I if) and the vertical \l)p = \V) polarization states of a 
single photon and prepared by a half waveplate (HWP) 
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which acts as 



\H) 
\V) 



-cos29\H) + sin20|F), (29a) 
sin2<9|#) -f cos2<9|V), (29b) 



where is the angle of the fast axis from the vertical 
axis. The elements of the ternary set of polarization qubit 
\^o)p, \^i)p: \^2)p can be prepared from the input of the 
|0)p state by setting = 0, 7r/6, tt/3 [rad], respectively. 

The ternary set of location qubit {\^ x ) l\ can be pre- 
pared by guiding the polarization letter states into two 
optical paths through a polarizing beam splitter (PBS). 
It reflects the vertical polarization and transmits the hor- 
izontal polarization as 



\H)a® I vacuum) b 
\V)a < 8> I vacuum) b 



\H)a ® Ivacuum)^, (30a) 
zlvacuum)^ \ V)b, (30b) 



where A and B are the labels for the two different optical 
paths. 

The length two coding can be realized in the Hilbert 
space spanned by the orthonormal bases |3(J |37i |3^ , 

|00) = |O)p0|O) L = |^)A0|vacuum) B , (31a) 

|01) = |O) P 0|l) L = |vacuum) A 0|i^) jB , (31b) 

|10) = |l) P 0|O) L = |F) A 0|vacuum) B , (31c) 

|11) = |l)p0|l) L = |vacuum) A 0|F)p. (31d) 

In this space the encodings can be performed by the op- 
tical circuit shown in Fig. |3| which consists of a PBS and 
three HWPs. With an input photon initially in the state 
1 00), the output of this encoder is given by 

cos2<9 cos2<9i|00) - sin2<9 sin2<9 2 |01) 
- cos 2<9 sin 26 1 1 10) + sin 2(9 cos 2(9 2 1 1 1) , 

(32) 



I*) = 



where 0q, 9\ and $2 are the angles of the three HWPs. By 
controlling these angles appropriately, polarization qubit 

states \ip x )p <S> |0)l, location qubit states |0)p <S> \i/j x )l, 
and the length two code word states \ip x )p <8> \^x)l can 
be prepared. 

Thus in our coding format, doubling the transmission 
resource is realized by doubling the spatial resource in- 
stead of by using two polarized photons. From the view- 
point of communication theory, this can be regarded as 
a kind of pulse position coding which is often used when 
the signal power available is severely limited. 



B. Accessible information 

The POVM for the accessible information /acc gener- 
ally consists of overcomplete nonorthogonal states, which 
is a typical example of generalized measurement. It is 
well known that such a POVM can be implemented by 
a von Neumann measurement in an extended Hilbert 
space, which is called the Naimark extension 28]. The 



|vacuum) ■ 
\H)- 




FIG. 3: Optical circuit for polarization- location encoding. 
HWP: half waveplate and PBS: polarizing beam splitter. 



■|0> or |1> 
von Neumann 
measurement 

-D-|0> or |1> 



FIG. 4: Quantum circuit to realize the optimal POVM for the 
accessible information {\wy){w y \} given in Eq. (|17f) . Nomen- 
clature of the gates is same as in Fig. |2] 



experiments j2<| of such measurement were already 
performed by using polarization qubits. We have per- 
formed such experiment again for both polarization and 
location qubits. We also show the logical quantum circuit 
of such POVM. 

The Naimark extension of the POVM {\uy){u y \} given 
in Eq. (|T7|) can be implemented by introducing an ancil- 
lary qubit with that in the initial state |0), and by con- 
structing the orthonormal bases whose projection onto 




HWP 



7Acc 



(a) 



PBS 



n 7Acc 
HWP- HWP^^ 1 

4 4 



■D-PD 

■D-pd 



PD 



(b) 



FIG. 5: Optical implementation of the circuit for the acces- 
sible information in Fig. 0] for (a) polarization qubit and (b) 
location qubit. PD: photodetector. 
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the original plane becomes {|cc^)}. The orthonormal 
bases are then decomposed into the unitary transforma- 
tion and the von Neumann measurement by two qubit 
separable bases. The schematic of this process is shown 
in Fig. 21 where the unitary operation is described by 
the quantum circuit consisting of four controlled unitary 
gates. This one seems to be almost the simplest circuit. 
As discussed in the previous subsection, such quantum 
circuit can be translated into the optical circuits consist- 
ing of linear elements. Figures |S{a) and (b) show the op- 
tical circuits for polarization and location qubit signals, 
respectively. 

The actual experimental setups for the polarization 
and location qubit trines are depicted in Fig. EJa) and 
(b), respectively. In both setups, the left and right of the 
dashed vertical line correspond to the circuits for gener- 
ating and measuring the signals, respectively. The signal 
states are generated by varying the angle of HWP1 with 
(j) x = 2ttx/3 (x — 0,1,2). It should be noted that, for 
practical reason, the original circuits in Fig.J^a) and (b) 
are modified by using the 50:50 beam splitter (BS) in- 
stead of the PBS, and the initial state of the ancillary 
qubit is set to the state |1). 

The experimental procedures and techniques are ba- 
sically the same as those in Ref. 30]. The whole cir- 
cuit in Fig. consists of a polarization Mach-Zehnder 
interferometer and is controlled by a Piezoelectric trans- 
ducer (PZT). The CW light from a He-Ne laser (Spectra- 
Physics, model 117A) operating at the wavelength of 
632.8 nm with 1 mW power is strongly attenuated by 
ND filters with a factor of 5 x 10 -10 such that about 
10 -2 photons exist on average in the whole circuit. The 
attenuated light is purified to the horizontally polarized 
state by a Glan-Thompson prism and then injected to 
the interferometer. The HWP in the encoder (HWP1) is 
driven by a stepping motor to generate the signal state 
{\ip x )p\x = 0,1,2} or {\ip x ) L \x = 0,1,2} sequentially. 
After passing through the circuit, the signal photons are 
guided into the silicon avalanche photodiodes (EG & G, 
SPCM-AQ-141-FC), APDO-2, whose quantum efficiency 
and darkcount are typically 70% and 100 counts/sec, re- 
spectively, through a multimode optical fiber with cou- 
pling efficiency of about 80%. The interferometer is en- 
closed in a darkened box. There are, however, back- 
ground photons amount to about 300 counts/sec, even 
if no laser light is injected. 

The mutual information is evaluated by constructing 
the 3x3 channel matrix [P(y\x) = K^yl^x)! 2 ] from a 
statistical data of single photon events detected by ei- 
ther of the three APDs conditioned on the input state 
\ij)x)' The mutual information thus obtained measures 
the ratio of number of bits retrieved per number of total 
photon counts. This event selection allows us to simu- 
late communications of sending and detecting photons in 
pure state one by one through a noiseless channel even 
when a photon source with random arrival times 
and photodetectors with imperfect efficiencies are used. 
The channel characteristics are then limited only by the 




APD0 



(a) 

\^x)l 




APD0 



(b) 



FIG. 6: Experimental setups for the accessible information 
for (a) polarization qubit and (b) location qubit. Here cj) x = 
2ttx/3 + o ff (x = 0, 1,2) with the offset angle o ff. HWP: 
half waveplate, PBS: polarizing beam splitter, BS: 50:50 beam 
splitter, and PZT: Piezoelectric transducer. 



non-commutativity of the signal states, imperfect align- 
ment of the whole interferometer, deviation from the lock 
points, and the darkcount of the APDs. 

The relative path length of the interferometer is ad- 
justed to be a proper operating point by using a bright 
reference beam and the PZT. The visibility of the inter- 
ferometer better than 98% is obtained. To circumvent 
injecting voltage noise from electronics, we simply used 
a low noise voltage source for adjusting and fixing mir- 
ror positions whereas an electrical feedback system was 
used in Ref. 30]. Once the circuit is adjusted, the ref- 
erence beam is shut off. The signal light is then guided 
into the whole circuit. Photon counts are measured for 
five-second duration. This procedure is repeated for each 
letter state, composing a full sequence of measuring the 
channel matrix. The temporal stability corresponds to 
the change of the relative path length within 3 nm for at 



9 



least more than 200 sec, which causes the error in mutual 
information ±0.005 bits at most. 

Figures [Ja) and (b) show the mutual informations 
measured for the polarization and location qubits, re- 
spectively. The offset angle of the horizontal axis is 
defined by the relative angle (/> ff between the signal 
and measurement state sets (see Fig. [S| and its caption). 
The accessible informations is measured at the zero off- 
set angles for both the polarization and the location 
qubits, and the results are 0.560 ± 0.005 bits/letter and 
0.557 ± 0.007 bits/letter, respectively. The average vis- 
ibilities at these points are evaluated to be 99.16% and 
99.05%, respectively. The difference between the data 
points and the ideal curve is mainly attributed to the 
imperfection of the PBSs. The result for the polariza- 
tion qubit can directly be compared to the previous ex- 
periments [2^, H(j and certifies that the quality of our 
interferometer is improved from those previous results. 

C. Ci limit 

The classical limit of the capacity C\ (= 
0.6454 bits/letter) is obtained by sending only two 
of three letters with 1/2 probabilities and a von 
Neumann measurement in the two-dimensional space 
(Fig. IHb)). The corresponding decoding circuits in the 
polarization and location qubits are shown in Figs. Efa) 
and (b). For the polarization qubit (Fig. Efa))> the 
polarization of the received photon is rotated by the 
HWP with 7r/12 radians, and then is discriminated 
by the PBS followed by APDs. Detection of the 
location qubit is straightforward, i.e. just detecting a 
photon at each optical path. The measured value is 
d = 0.644 ± 0.001 bits/letter, for both the polarization 
and location qubits. 

D. Length two coding 



0.60 




n r .... 1 . ... 1 . ... 1 ... . 1 .... 1 .... i 

-30 -20 -10 10 20 30 

IN 
(a) 



0.60 




-30 -20 -10 10 20 30 

(b) 



FIG. 7: Measured (filled diamonds) and theoretical (solid 
curve) accessible information for (a) the polarization qubit 
signal and (b) the location qubit signal as a function of the 
offset angle o ff. The dashed curve is the mutual information 
obtainable by a standard von Neumann measurement |3(^ . 



The length two code word states \^ xx ) = \^ x ) p®\^ x ) l 
are encoded by the optical circuit of Fig. |3l The angles 
of HWPs are set as 

' = \ arctan 

2 y l+cos <p x 

< 6>i = ± arctan ( ~ si "tf ) , (33) 

1 2 ^ l+cos <j> x I ' V I 

2 = h arctan ( - gin fr ) 

2 V 1 — COS (px J 

where (j) x = 2ttx/3 (x — 0,1,2) corresponds to the en- 
coding parameter in Eq. (|2U|) . 

Figure O shows the optical circuit that corresponds to 
the collective decoding circuit in Fig. [3 It is further 
simplified for practical convenience, and the whole ex- 
perimental setup including encoder is shown in Fig. EH 
It includes a polarization and a normal interferometers 
in Mach-Zehnder arrangements. Each interferometer is 
aligned independently to achieve a visibility better than 



98%. Received code words are decided to be either of 
l^oo), l^ii)? or ^22) according to the reception of the 
photon by APD0, APD1, APD2, respectively. Other pro- 
cedures and conditions are the same as those of the ex- 
periment for the accessible information extraction. 

Figure [H] shows a typical experimental data that corre- 
sponds to the unnormalized value of each element of the 
channel matrix [P(yy\xx)\ Ideally, the ratio of the diago- 
nal and off-diagonal elements must be 0.9714 and 0.0143, 
respectively. The total events counted for 1 sec is of order 
10 6 , while the average count for the off-diagonal elements 
is about 1.9 x 10 4 including the darkcounts of the three 
APDs. The total background count is 2% of the average 
count for the off-diagonal elements. The mutual informa- 
tion is evaluated as I(X 2 : Y 2 ) = 1.312 ±0.005 bits. The 
averaged visibility of the whole system is evaluated to be 
98.48%, which is slightly worse than the result for /acc- 
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FIG. 8: Optical circuits for the C\ detection, (a) Polarization 
qubit, (b) location qubit. 
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FIG. 9: Optical circuit for the collective decoding described 
by the square root measurement. 



This degradation is mainly due to the relative difference 
of the polarization axis between two interferometers. 

For experimental clarity, we measured the variation 
of the mutual information when the code word state 
set {l^cc)} is rotated relative to the decoder state set 
Hn^)} around the vertical axis in Fig. ^c). The rota- 
tion is achieved by adding an offset angle o ff to <j) x in 
Eq. ((55)1 . Figure [El shows the result in which the exper- 
imental data of the collective decoding (filled diamonds) 
are compared to its ideal curve (solid curve). The exper- 
imental and ideal values of C\ and /acc are also shown. 
The difference between the data points and the ideal 
curve is attributed again to the imperfection of the PBSs 
and also to the relative difference of the polarization axis 
between two interferometers. The obtained mutual infor- 
mation, 0.656±0.003 bits/letter, exceeds the theoretical 
limit of the classical capacity C\ — 0.6454 bits/letter. 
Classical length two coding corresponds to the use of po- 
larization and location channels at a time in a separa- 
ble decoding circuit construction, which does not include 
any entangling operation. Then the retrievable informa- 
tion can never exceed 2C\. Our result, therefore, clearly 
shows the experimental evidence of the superadditivity, 
that is, the increase of information more than twice ob- 
tained by inserting an appropriate quantum circuit to 
entangle two letter states. 



VI. QUANTUM-CLASSICAL HYBRID CODING 

The SQCG in small blocks is not only of proof-of- 
principle demonstration but also of practical importance 
in quantum-limited communications. Even two-qubit 
quantum circuit like Fig. El is useful in boosting the per- 
formance of a classical decoder. In this section, using the 
case of the ternary letter state set of Eq. (fT5|) . we show 
how the two-qubit quantum decoder can be combined 
with a classical decoder to improve the total communi- 
cation performance. 

Let us start with a classical channel coding of length n, 
whose schematic is shown in Fig. PHIT a). We assume that 
the letter states are the ternary symmetric states given 
by linear polarizations of a single photon. A code word 
is then physically represented by a sequence of optical 
pulses of single photon, which is a tensor product of each 
letter state. At the receiving side, each optical pulse en- 
ters a photodiode, and is converted into an electric pulse, 
namely, a classical signal. This opto-electric conversion is 
made on each letter state separately. The classical elec- 
tric signals are then processed by an electric circuit (a 
classical decoder) to reconstruct an output message. 

The capacity attained by this classical coding scheme 
is given by the quantity C\. As discussed in Sec. II VI 
C\ (=0.6454 bits/letter) is attained by making the code 
words from only two letter states instead of using all 
three, and by performing the binary measurement. In 
other words, the channel is equivalent to the binary sym- 
metric channel whose error probability is characterized 

by 



P(l|0) 



P(0|1) 

W I 2 ) 



(34) 



The transmission rate for this channel is defined by R c = 
k/n. So if R c < C\ = 0.6454, one must be able to find 
codes for which the decoding error approaches zero in the 
limit n — > oo. 

In practice, however, one wants to know the decod- 
ing error for finite length n. According to the theory 
of reliability function 0, there exists a length n (clas- 
sical) coding attaining P e < 2~ nEr ^ R \ where E r (R) is 
the lower bound of the reliability function. The function 
E r (R) is useful to investigate the maximum communica- 
tion performance of channels for a given code length. 

Let us denote the function E r (R) for the classical 
decoding scheme as E^(R C ). As for the definition of 
E r (R) and the expression of E^(R C ), see Appendix. We 
now consider, as examples, two cases where transmis- 
sion rates are low (R c = 0.1 = 0.15 x Ci) and high 
(R c = 0.62 = 0.96 x Ci). In these cases, we obtain 
£°(0.1) = 5.218 x 10- 4 and £^(0.62) = 0.3150, respec- 
tively, and the asymptotic behaviors of the decoding er- 
ror probabilities are shown in Fig. (the dashed curves). 
This is a typical error performance obtained by averag- 
ing over all possible classical codes. It means that there 
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APDO 



FIG. 10: Experimental setup. HWP: half waveplate, PBS: polarizing beam splitter, BS: 50:50 beam splitter, and PZT: Piezo 
transducer. The HWP angles 60-62 are given in the text. 
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FIG. 11: Histogram of photon counts for the channel matrix 
elements P(yy\xx) = \{H yy \^/ xx )\ 2 (unnormalized). 
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must exist at least one code that exhibits a performance 
superior to that shown in Fig. EI 

Now we turn to the other scheme, in which the above 
classical coding is combined with quantum channel cod- 
ing. We call such a combination of two coding schemes 
as quantum-classical hybrid coding (QCHC). In the fol- 
lowing, we show the power of QCHC by discussing the 
QCHC scheme with the length two quantum coding. Its 
schematic is shown in Fig. lT^T b). Given the length n (as- 
sumed to be an even number), we consider a classical cod- 
ing of length n/2 with the composite letters {00, 11, 22}. 
In decoding, the received code word state is first pro- 
cessed by the two-qubit quantum circuit shown in Fig.0 
and then is detected by two photodiodes. This is the 
quantum collective decoding consisting of the square root 
measurement. The resulting n/2 electric pulses are fi- 
nally processed by a classical decoder. 

The channel model of this scheme is equivalent to a 



FIG. 12: Measured (filled diamonds) and theoretical (solid 
curve) mutual information as a function of the offset angle 
of the code word state set {l^cc)} from the decoder state set 
{|n yy )} around the vertical axis in Fig.QJc). The dotted curve 
is just the guide for eyes. The experimental and theoretical 
Ci are shown by the square and the dashed line, respectively. 
The accessible information /acc experimentally observed for 
polarization and location qubit, and theoretically predicted 
are shown by the open diamond, open circle, and one-dotted 
line, respectively. 

classical coding of length n/2 based on the ternary sym- 
metric channel given by Eq. (|2^|) . The transmission rate 
for this channel is now defined by R^ c = (k/n)\og 2 3 
and it can be raised up to the mutual information of this 
channel, I(X 2 : Y 2 ) = 1.3690. It means that the rate 
k/n can be raised up to I(X 2 : Y 2 )/ log 2 3 = 0.8637. 

Let us denote the function E r (R) for the QCHC as 
E® C (R QC ). The expression of E^ C (R QC ) is also given 
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FIG. 13: (a) Classical channel coding system, (b) Quantum-classical hybrid channel coding system with the letter two quantum 
collective decoding. 




Length n 



FIG. 14: Decoding error probabilities with finite code length 
for the QCHC (solid lines) and the all classical coding (dashed 
lines) . 



in Appendix. To compare the performance of Efi c (R® ) 
with that of E^(R C ), the rate k/n should be fixed. For 
the same rates as before, k/n = 0.1 and k/n = 0.62, 
the transmission rates are given by R® c = 0.1585 and 
R QC = 0.9827, respectively, The function E? C (R QC ) at 
these rates are evaluated to be E^ c (0.1585) = 0.8415 
and £^ c (0.9827) = 9.753 x 10~ 2 , respectively. The 
upper bound of the error probability is then given by 



P e = 2 ( n / 2 ) E ? c . The error probabilities for these exam- 
ples are shown by the solid curves in Fig. EI 

As seen in this figure, at the rate of k/n = 0.62, the 
decoding error starts to decrease rapidly over n ~ 10000 
in the classical coding scheme. In the QCHC scheme, 
the error starts to decrease from n ~ 100, and reaches 
the standard error-free criterion 10 -9 around n ~ 600 
(300 composite letter pairs). With this code length, it 
is impossible to transmit any information reliably by the 
classical coding scheme |42j. This improvement can be 
achieved just by inserting the two-qubit quantum decoder 
in front of the classical decoder. To achieve the standard 
error-free criterion by classical coding, one must use code 
words of length n ~ 57300. 

As codes get longer, the complexity of the decoder, 
such as the total number of arithmetic operations x(n), 
increases and eventually limits the effective transmission 
speed. The total decoding time per letter by the device 
with finite speed To sec/step is given by ^^to sec/letter. 
Presumably, this is a limiting factor and thus we can de- 
fine the effective transmission speed as R e ff = R X ^ TQ 
bits/sec. For some asymptotically good codes, the to- 
tal number of arithmetic operations is typically of or- 
der x(n) = 0((n log n) 2 ) 43]. Therefore, the effective 
transmission speed behaves as R e fi ex n ^J^ n y • Then the 
reduction of code length brought by QCHC will be prac- 
tically significant in the trade-off between performance 
and complexity. In our example, even in the lower rate, 
the decoding error of the QCHC around the standard er- 
ror free criterion is two figures smaller than that of the 
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classical coding with the same code length. 



VII. CONCLUDING REMARKS 

In this paper, we have detailed the experimental 
demonstration of the superadditive quantum coding gain 
(SQCG). The superadditive quantum coding becomes es- 
sential in the region where the noncommutativity of sig- 
nal states is the main ambiguity among signals. As a typ- 
ical example, we have mentioned deep space optical com- 
munications in Sec. [I] where we have to extract as much 
information as possible from the sequences of heavily at- 
tenuated signals. Apparently, it is not realistic in such 
a situation to transmit nonclassical states or to install 
quantum repeaters. Therefore, only quantum decoding 
can be the core technology to achieve the ultimate per- 
formance in future long-haul optical communications. 

Quantum-classical hybrid coding (QCHC) is then a 
promising approach. We have shown in Section IVT1 that 
even a small scale quantum computing can enhance the 
effective transmission speed when it is used together with 
large scale classical coding. Thus the QCHC allows one 
to extend conventional optical communications technol- 
ogy to the quantum limit in a straightforward way. This 
may be contrasted to the fact that other known quan- 
tum algorithms exhibit the advantage over their classical 
counterparts only when a large scale quantum computer 
is available. For the QCHC applications, the communica- 
tion performance of QCHC gets better as available scale 
of quantum computer gets larger. 

Finally, we mention the challenges to be overcome to 
bring a QCHC system into reality. Although our result 
clearly demonstrated the principle of SQCG, the phys- 
ical scheme used in our experiment is still not suitable 
for real applications, i.e. is not applicable to weak coher- 
ent signals. To implement a quantum collective decoder 
for weak coherent signals, one must be able to entangle 
weak coherent optical pulses with respect to the degrees 
of freedom of phase and/or amplitude. Heavily attenu- 
ated coherent signals can be approximated by superposi- 
tions of zero and one photon states as \ctk) ~ |0) 
It means that the proposals of qubit gating may be ap- 
plicable to our problem. 

Quantum gating operation at single photon level has 
been investigated by many authors, but it still remains 
as a challenging topic. One scenario suggested in Ref. Q 
is to transfer the information in optical field to a multi- 
level single atom inside a high-finesse optical cavity in 
order to perform gating operations by Raman process. 
Another possible way is to use atomic system as a nonlin- 
ear medium with the idea of electromagnetically induced 
transparency 4j| . Finally, a recent proposal suggests the 
possibility of an optical quantum circuit based on linear 
optics |45| . As single photon on-demand sources [H and 
highly efficient photon detectors become available, small 
scale quantum gating circuits for coherent state signals 
can be realized, in principle, with only linear optics. 



From the viewpoint of coding theory, on the other 
hand, it is still open to find asymptotically good quantum 
channel codes. In the case of pure state channels, Ref. 
tells us that the problem is essentially the selection 
of appropriate sequences for code words, and the square 
root measurement does the decoding. It is also important 
to establish a systematic theory to synthesize a quantum 
circuit for intermediate scale collective decoding. 



APPENDIX: LOWER BOUND OF THE 
RELIABILITY FUNCTION 

In this Appendix, we give the definition of reliability 
function and its lower bound, E r (R). The latter one is 
applied to the channels describing the classical coding 
and the QCHC with the length two quantum coding, 
respectively, for the qubit trine signal. 

The reliability function is defined as Q, 



E(R) = lim sup ■ 



• In P e (n,R) 



n 



(A.l) 



where P e (n, R) is the minimum error probability over all 
(n, R) codes. Although the true E(R) for any R has not 
been clarified yet, it is known that its lower bound is 
given by 



E r (R) — max max [Eo(p) 

P {P(x)} 



where 



pR] , (A.2) 



(i+p) 



E (p, P(x)) = -log£ (j2P(x)P(y\x)^ 1+ A 

y \ x / 

(A.3) 

with < p < 1. The function E r (R) yields the upper 
bound of an average error probability, P e , for the code 
with given n and R by P e < 2~ nEr ( R \ We also note that, 
for any symmetric channels, E r (R) is maximized when 
the all signals are given by the equal prior probability 
distribution @|. 

As discussed in Sec. IVII the channel matrix attaining 
the capacity of the classical coding for the qubit trine 
signal is given by the binary symmetric channel with the 
channel matrix 



[P(y\x)} 



1 



(A.4) 



where e is given in Eq. The analytic expression of 

E r (R) for this channel can easily be derived. First we 
define the quantities, 



and 



e V(i+p) 

6p = e i/(i+p) + (l_ e )i/(i+p) : 
i2o = l-fT(ei), 



(A.5) 



(A.6) 
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where 

H(e p ) = -e p log 2 e p - (1 - e p ) log 2 (l - e p ). (A.7) 

After maximizing £^o(p, ^(^)) over P{x) and p, we obtain 
that if R < i? , 



£y (#) = 1 - 21og 2 (y^ + Vl-e) - i?, 
and if R < R < Ci, 

E?(R) = e p * log 2 "-f + (1 - e p *) log 2 

where p* is the solution of 

R=l- H(e p ). 



(A, 



, (A.9) 



(A.10) 



and 

i2o=log 2 3-ri-iT(ri). (A.12) 
Then if R < R , 

E? G (R) = log 2 3 - 21og 2 (cos 1 + V2sin |) , (A.13) 
andifi? <R<I(X 2 : Y 2 ), 



e? c (r) = r> io g2 + (i - r> ) io g2 1 r " 



2 7 

sin I 



cos 



The channel matrix for the QCHC discussed in Sec. I VII 
is given in Eq. (|24[). which is a ternary symmetric channel. where p* is the solution of 
The expression of E r (R) for a ternary symmetric channel 
can also be derived with the quantities, 



(sin 2 |) 



2 7 xl/(l+p) 



R = \og 2 3-T p -H(T p ). 



2 7 ' 

(A.14) 



(A.15) 



I(2cos^) 1/(1 ^ + (sin 2 i) 



2 7 a/(i+p) ; 



(A.ll) 
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